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K.-I. Goh, G. Salvi, B. Kahng, and D. Kim 
School of Physics and Center for Theoretical Physics, Seoul National University, Seoul 151-747, Korea 

(Dated: February 2, 2008) 

We find that the fractal scaling in a class of scale-free networks originates from the underlying tree 
structure called skeleton, a special type of spanning tree based on the edge betweenness centrality. 
The fractal skeleton has the property of the critical branching tree. The original fractal networks are 
viewed as a fractal skeleton dressed with local shortcuts. An in-silico model with both the fractal 
scaling and the scale-invariance properties is also constructed. The framework of fractal networks 
is useful in understanding the utility and the redundancy in networked systems. 
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The emerging, unifying concepts such as the small- 
world property [l( , the scale-free behavior |2| , and the hi- 
erarchical modularity |j| now constitute our basic under- 
standing of the organization of complex networked sys- 
tems, which appear in as diverse examples as the world- 
wide web, the social networks, and the biochemical re- 
action networks inside cells. The small- world property 
refers to the one that the average separation (D) between 
pairs of vertices in the network scales at most logarith- 
mically in the total number of vertices N in the system, 
(D) ~ InN. The scale-free behavior means the lack of 
characteristic scales in the number of links k a vertex 
has, called the degree, manifesting itself in the form of 
a power-law degree distribution Pd(k) ~ fc -7 for large k. 
Recent discovery of fractal scaling and topological self- 
similarity in the world-wide web and the metabolic net- 
works 4], however, raised a new perspective on our view 
of such networked systems. The fractal scaling stands for 
the power-law relation between the minimum number of 
boxes Nb needed to cover the entire network and the size 
of the boxes £b, 

N B (£ B )~£- dB , (1) 

with a finite fractal dimension ds 0- The self-similarity 
here refers to the scale-invariance of the degree distri- 
bution under the coarse-graining with different box sizes 
(length scales) as well as under the iterative application of 
the coarse-graining (the network renormalization) 0, . 
It has been observed, however, that not all networks are 
fractal and the most random network models proposed 
yet are not fractal, either. This poses a fundamental 
question on the origin of the fractal scaling observed in 
the real- world networks 0, 0, El- In this Letter, we 
show that the fractal property of the network can be un- 
derstood from its underlying tree structure. 

While highly entangled as a network looks, a more sim- 
ple structure is embedded underneath it, that is the span- 
ning tree. A spanning tree is a tree composed of A — 1 
edges in a way that they connect all the A vertices in 
the network. Of particular significance is the so-called 
skeleton of a network. The skeleton is a particular 
spanning tree, which is formed by the edges with highest 



betweenness centralities [12J or loads |lj|. The re- 
maining edges in the system are called shortcuts, which 
contribute to forming loops. The skeleton of a scale- 
free network is also scale- free but with different 7. Since 
the betweenness centrality is related to the amount of 
information flow along a given edge, the skeleton can be 
considered as the "communication kernel" of the network 
|lOl | . If a network is organized in a modular way, as it is 
believed to be so for the world-wide web and the biologi- 
cal systems, the inter-modular connections offer commu- 
nication channels across the modules, thus gaining high 
betweenness centralities. By construction, the skeleton is 
composed preferentially of such high-betweenness inter- 
modular connections, which will preserve the modular 
structure while greatly simplifying the complexity. Fur- 
thermore, if the modular structure is distinct enough, 
i.e., there is a rather clear-cut separation between mod- 
ules, we can expect that even a random spanning tree 
can capture the modular structure. Thus by looking at 
the properties of its spanning trees, we can visualize more 
easily the topological organization of the network. 

With the underlying skeleton and random spanning 
tree, here we perform the fractal scaling analysis by mea- 
suring Nr(£r ) fo r several real- world networks and net- 
work models Comparison of the fractal scalings 
in each original network with the corresponding span- 
ning trees reveals distinct patterns according to the pres- 
ence or the absence of fractality in the network. For the 
fractal networks, such as the world-wide web [Fig. 1(a)] 
, the metabolic network of Escherichia coll [Fig. 1 (b)] 
lfij . and the protein interaction network of Homo sapi- 
ens [Fig. 1(c)] 0, the numbers of boxes needed to cover 
the original network and its skeleton are almost the same. 
Moreover, the random spanning tree, while possessing a 
different statistics of Nb, shows nevertheless the same 
fractal dimension ds- This is surprising, because in the 
world-wide web, for example, more than 2N edges (short- 
cuts) are added onto the skeleton (the average degree of 
the world-wide web is 6.7), which is a tremendous num- 
ber in the graph-theoretical sense, and by no means a 
minute perturbation. Such a robustness of fractal scaling 
in the world-wide web shows that even though the net- 
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FIG. 1: (Color) (a— e) Box counting analysis of original networks (o, red)) and their skeleton (V, blue) and random spanning 
tree (A, orange). Shown are cases for the world-wide web (a), the metabolic network of Escherichia coli (b), and the protein 
interaction network of Homo sapiens (c), the Internet at the autonomous systems level as of the year 2004 (d), and the static 
model network with 7 = 2.3 and (k) = 4 (e). In (a-c), we also show two guidelines for the fractal scaling of the original 
network (black) and its random spanning tree (gray). Slope of each guideline is (a) —4.1, (b) —3.5, and (c) —2.3. In (d-e), 
the black line is a fit to the exponential function and the gray one is to a power law. (a'— e') Branching analysis. The mean 
branching number as a function of distance from the root for the skeleton (V, blue) and the random spanning tree (A, orange) 
of the networks in (a-e). For (a'-c'), both the skeleton and the random spanning tree fulfill the criticality condition, (to) = 1 
(horizontal line), as the distance from the root increases, while for (d'-e'), the mean branching number of the skeleton decays 
to zero with no plateau at (m) — 1. 



work is far from being a tree, the shortcuts are distributed 
in a way that they preserve the fractality and modular- 
ity. In other words, shortcuts are mainly present inside 
modules and the connections between different modules 
are largely made through the skeleton. This topologi- 
cal structure can be measured by the fraction of intra- 
branch shortcuts among the total number of shortcuts, 
a branch being the subtree connected to the most con- 
nected vertex. We find that the ratio is 0.78, 0.33, and 
0.45 for Figs. 1(a), 1(b), and 1(c), respectively. On the 
other hand, other networks exhibit different features in 
the fractal scaling analysis. For example, the Internet 
autonomous systems [Fig. 1(d)] or the static model with 
7 = 2.3 and (k) = 4 [Fig. 1(e)], the box counting num- 
ber of the original network and the skeleton decays with 
£b much faster than that of the random spanning tree, 
and the fraction of intra-branch shortcuts is small: 0.087 
for Fig. 1(d) and 0.015 for Fig. 1(e). Also for the social 
networks, such as the actor network and the collabora- 
tion network, the Nb(£b) curves are appreciably differ- 
ent from those of the skeletons, implying that the global 
topology of the social network is highly interwoven on a 
large scale to form a more compact structure [l8|. 

The scaling behavior of the box counting relation 
Eq. (1) in Figs. l(a)-l(c) for the original network and 
its skeleton suggests that the fractal property of the net- 
work originates from that of the skeleton. In addition, 
we argue here that the criticality in the topology of the 



skeleton is required for a network to be a fractal: The tree 
structures such as the skeleton and the random spanning 
tree may be seen as generated through a multiplicative 
branching process starting from a root vertex [19j. At 
each branching step, each vertex born in the previous 
step generates m offsprings with probability b m . The 
criticality condition means the average branching num- 
ber, 

00 

(m) = mbm = 1 - ( 2 ) 

m=0 

Thus the branching tree grows perpetually with off- 
springs neither flourishing nor dying out. In this case, 
when b m ~ m~ 7 , the number of vertices s in the tree 
scales with its linear size t in a power-law form as s ~ t z 
with z = (7 - l)/(7 - 2) for 2 < 7 < 3 and z = 2 for 
7 > 3 0, 12J} , and the tree structure is fractal with frac- 
tal dimension cIb = z. Such a critical branching tree is 
similar in the topological characteristics to the homoge- 
neous scale- free tree network proposed in Ref. |2l|. To 
check the validity of our suggestion, we examine if the 
criticality condition is fulfilled for the skeleton and the 
random spanning trees of the four real-world networks 
and the static model in Figs. l(a')-l(e'). Indeed, for the 
fractal networks [Figs. l(a')-l(c')], both the skeleton and 
the random spanning tree fulfill the criticality condition, 
even though in reality, the dynamic origins of their for- 
mations may well be more complicated than the pure 
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branching dynamics. In our analysis, the root is taken as 
the most connected vertex in the tree. On the other hand, 
for non-fractal networks, the mean number of branches of 
the skeleton decays to zero rapidly as the distance from 
the root increases [see Figs, l(d') and l(e')]. A sim- 
ilar behavior is observed for the actor network as well 
(lif. Thus the actor network is not a fractal. However, 
the random spanning tree satisfies the criticality condi- 
tion in all cases, suggesting the generic fractal structure 
of this kind of trees as shown in Ref. [13 ■ In short, a 
fractal network contains a fractal skeleton underneath it, 
which is perturbed by local shortcuts, thus preserving its 
fractal property. 

Incorporating all the findings so far, we set up a fractal 
network model. The model is based on the multiplica- 
tive branching tree [lflj j . We first introduce an exponent 
7 for the branching probability b m ~ m 1 (m > 1), 
designed to produce the desired power-law degree distri- 
bution Pd{k) ~ k^ 1 with 7 > 2. The branching probabil- 
ity is properly normalized to be critical, i.e., it satisfies 
Eq. (2). Next, a parameter p is introduced to control the 
number of shortcuts added, and hence the mean degree 
of the network. One final parameter q accounts for the 
relative frequency of the local and the global shortcuts. 
The construction of the model network proceeds as fol- 
lows: i) A tree is grown by the multiplicative branching 
rule with branching probability b m . ii) After the branch- 
ing process, every vertex increases its degree by a factor 
p and attempts to make shortcuts to its local neighbors. 
Hi) For each successful shortcut in ii), with probability 
q, we replace it by reconnecting it to a randomly chosen 
vertex, not restricted to its local neighbors. In the latter 
case, we choose the vertex with a weight in proportion 
to its degree in the branching tree, so as to maintain the 
same power-law scaling of the degree distribution. This 
rule is schematized in Fig. 2(e) [l^J- Fig. 2(a) is an il- 
lustrative example of a branching tree of size N = 164 
and 7 = 2.3. The vertices of the tree are colored ac- 
cording to which box they belong to in a particular box- 
covering with £b — 2. For such a tree, even a simple 
graph drawing algorithm, such as Pajek J2§j, can cap- 
ture its inherent hierarchical structure. Fig. 2(b) shows 
the fractal network structure with the addition of local 
shortcuts, where we use p = 0.5 and q — 0. The hier- 
archical modular structure presented in the tree network 
[Fig. 2(a)] persists. On the other hand, the network with 
q = 1 shown in Fig. 2(c), in which the same number of 
shortcuts as in Fig. 2(b) are attached, does not retain the 
modularity. Such an absence of modularity can be readily 
seen in Fig. 2(d), a different layout of the same network 
as Fig. 2(c) generated by Pajek in an unsupervised man- 
ner. Consequently, the fractal property is preserved in 
the case of q = 0, whereas it is not for q = 1, as is clearly 
revealed by the box counting analysis in Fig. 2(f). 

It is noteworthy that there exists an important dis- 
tinction between the present model and other scale-free 




FIG. 2: (a) Uncorreiated scale-free tree with the degree expo- 
nent 7 = 2.3 and the number of vertices JV = 164. It is grown 
by the multiplicative branching rule [l9|l . with the branching 
probability b m ~ m~ 7 . (b) Fractal model network created by 
adding local shortcuts (green) to the tree in (a), (c) A non- 
fractal model network created by adding random shortcuts 
(blue) allowing global connection, (d) The generic configura- 
tion of the network in (c) generated by Pajek, and the absence 
of inherent modular structure. In (b-d), the color of each ver- 
tex is that of the corresponding vertex in (a), (e) Schematic 
illustration of the growth rule of the fractal network model. 
The shaded region indicates the rest of the network generated 
so far. (f ) The fractal scaling analysis of networks with larger 
size N = 311, 043, constructed in the same manner as those 
in (a)-(c). Squares (blue) correspond to the fractal network 
model in (b) with p = 0.5, circles (black) to its skeleton, and 
the diamonds (red) to the non-fractal network model in (c) 
with q = 1. Solid line is the guideline with slope « —2.8. 



trees such as the Barabasi-Albert tree |2j and the geo- 
metrically growing scale- free tree 24]. Such models are 
not fractal, because they do not fulfill the criticality con- 
dition. Indeed, their mean branching rate decreases to 
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zero monotonically as the branching proceeds, without a 
plateau at (m) ~ 1 . Such a type of tree network was 
classified as "causal" trees in Ref. [25j. Note also that 
the fractal trees are not small-worlds. However, with a 
small number of global shortcuts, e.g., q = 0.01 in our 
model with N ~ 3xl0 5 , the network turns into a small 
world. This is seen in the mean box mass versus Ib plot 
in the cluster growing method analysis [l8j |. 

The fractal network model is self-similar. To check it, 
we perform the coarse-graining through the box counting 
method by replacing each box with a single super-node, 
and connecting them if any of their member vertices is 
connected 0. We find that the degree distribution of 
the fractal network model is invariant under the coarse- 
graining by the boxes with different sizes. In addition, 
the degree distribution is invariant under successive ap- 
plications of the coarse-graining transformation ^^|. It 
is interesting to note that some networks are self-similar, 
that is, exhibit the scale-invariant degree distribution, 
yet are not fractal. Typical example of such networks is 
the Internet 0] . So the fractality and the self-similarity 
do not always imply each other in complex networks. 

The framework of fractal network is helpful to under- 
stand, for example, the utility and the redundancy in 
the metabolic networks from a purely topological aspect. 
The high flux backbone in the metabolic network of E. 
coli obtained through the flux balance analysis was shown 
to be composed of many branches with few inter-branch 
connections and to merge into the biomass reaction |26| . 
Obviously, its topological shape resembles the branch- 
ing tree skeleton rooted from a vertex with the largest 
number of connections if the direction in edge is ignored. 
On the other hand, recent in silico flux analysis |27| has 
shown that the metabolic network of E. coli contains high 
density of backup reactions (redundancies) for a given 
condition. Such a reaction is barely used in the normal 
condition, but takes up a high flux when a certain reac- 
tion on the backbone is blocked. When the simultane- 
ous blockade of such a reaction pair blocks the biomass 
production, they are called synthetic lethal Most 
synthetic lethal reactions are located very close to each 
other, being apart in three reaction steps or less along 
the metabolic network. Also they are mostly in the same 
functional category. Thus the reactions with high (low) 
flux in wild type can be regarded as the edges on the 
skeleton (shortcuts) in the framework of the fractal net- 
work. 

The critical branching tree that can be found in various 
phenomena such as earthquake processes, population and 
biological dynamics, epidemics, social cascades, etc., also 
appears in the skeleton of fractal networks, as we found 
here. While the evolving process of the fractal networks 
would be complex and diverse depending on the specific 
systems, the underlying structure, the skeleton, has the 
topology of the critical branching tree. Identifying such 
a simple structure underneath is a step forward towards 



further studies on the renormalization and the universal- 
ity in complex networks. 
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